USC-TIMIT: A database of multimodal speech production data
نویسندگان
چکیده
USC-TIMIT is a speech production database under ongoing development, which currently includes real-time magnetic resonance imaging data from five male and five female speakers of American English, and electromagnetic articulography data from five of these speakers. The two modalities were recorded in two independent sessions while the subjects produced the same 460 sentence corpus. In both cases acoustics were recorded in parallel with the articulatory data, and phonemically transcribed. The database, and companion techniques for reconstruction, processing and linguistic analysis, are freely available to the research community.
منابع مشابه
Real-time magnetic resonance imaging and electromagnetic articulography database for speech production research (TC).
USC-TIMIT is an extensive database of multimodal speech production data, developed to complement existing resources available to the speech research community and with the intention of being continuously refined and augmented. The database currently includes real-time magnetic resonance imaging data from five male and five female speakers of American English. Electromagnetic articulography data...
متن کاملA Multimodal Real-Time MRI Articulatory Corpus for Speech Research
We present MRI-TIMIT: a large-scale database of synchronized audio and real-time magnetic resonance imaging (rtMRI) data for speech research. The database currently consists of speech data acquired from two male and two female speakers of American English. Subjects’ upper airways were imaged in the midsagittal plane while reading the same 460 sentence corpus used in the MOCHA-TIMIT corpus [1]. ...
متن کاملInvestigation of Speed-Accuracy Tradeoffs in Speech Production Using Real-Time Magnetic Resonance Imaging
Motor actions in speech production are both rapid and highly dexterous, even though speed and accuracy are often thought to conflict. Fitts’ law has served as a rigorous formulation of the fundamental speed-accuracy tradeoff in other domains of human motor action, but has not been directly examined with respect to speech production. This paper examines Fitts’ law in speech articulation kinemati...
متن کاملThe USC CreativeIT database of multimodal dyadic interactions: from speech and full body motion capture to continuous emotional annotations
Improvised acting is a viable technique to study expressive human communication and to shed light into actors’ creativity. The USC CreativeIT database provides a novel, freely-available multimodal resource for the study of theatrical improvisation and rich expressive human behavior (speech and body language) in dyadic interactions. The theoretical design of the database is based on the well-est...
متن کاملپایهگذاری بستری نو و کارآمد در حوزه بازشناسی گفتار فارسی
Although researches in the field of Persian speech recognition claim a thirty-year-old history in Iran which has achieved considerable progresses, due to the lack of well-defined experimental framework, outcomes from many of these researches are not comparable to each other and their accurate assessment won’t be possible. The experimental framework includes ASR toolkit and speech database ...
متن کامل